A Layered Architecture for Accelerating Matrix Computations

نویسنده

  • A. Burdeniuk
چکیده

Many scientific and engineering applications are naturally expressed in terms of dense matrix or vector algorithms. This paper presents an FGPA-based matrix acceleration engine that improves the performance of dense matrix and vector algorithms. The engine offloads common data management tasks normally performed by processing resources onto a dedicated sequencer. The sequencer adopts an event-based parameter update mechanism to give it the flexibility to accelerate a wide range of matrix algorithms. An implementation of the sequencer has been developed and tested on a Xilinx Virtex-5LX FPGA. This implementation uses 1856 slices and has an operating speed of 64 MHz, making it comparable to the OpenRISC CPU core to which it attaches.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sparse Matrix Multiplication on CAM Based Accelerator

Sparse matrix multiplication is an important component of linear algebra computations. In this paper, an architecture based on Content Addressable Memory (CAM) and Resistive Content Addressable Memory (ReCAM) is proposed for accelerating sparse matrix by sparse vector and matrix multiplication in CSR format. Using functional simulation, we show that the proposed ReCAM-based accelerator exhibits...

متن کامل

A Method for Determination of the Fundamental Period of Layered Soil Profiles

In this study, a method is proposed to determine the fundamental period of layered soil profiles. A model considering the layered soil as shear type structure is used. At first, the soil profile is divided into substructures. Then, the stiffness matrices of the substructures considered as the equivalent shear structures are assembled according to the Finite Element Method. Thereinafter, the sti...

متن کامل

Implementation of Low-Cost Architecture for Control an Active Front End Rectifier

In AC-DC power conversion, active front end rectifiers offer several advantages over diode rectifiers such as bidirectional power flow capability, sinusoidal input currents and controllable power factor. A digital finite control set model predictive controller based on fixed-point computations of an active front end rectifier with unity displacement of input voltage and current to improve dynam...

متن کامل

An Improved Distance Matrix Computation Algorithm for Multicore Clusters

Distance matrix has diverse usage in different research areas. Its computation is typically an essential task in most bioinformatics applications, especially in multiple sequence alignment. The gigantic explosion of biological sequence databases leads to an urgent need for accelerating these computations. DistVect algorithm was introduced in the paper of Al-Neama et al. (in press) to present a ...

متن کامل

Synthesis of Control Software in a Layered Architecture from Hybrid Automata

This paper deals with the synthesis of control software for hybrid systems speciied as hybrid automata. Instead of generating the software from scratch, the synthesis is based on a generic layered software architecture which supports both periodic and event-triggered computations. The use of the layered software architecture as the framework for implementing hybrid controllers is motivated in t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009